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CRYSTTU^ STRUCTURE OF PANTOTHENATE SYNTHETASE 

Field of the Invention 

The present invention relates to the enzyme pantothenate, 
synthetase, and in particular its crystal structure and the 
use of this structure in drug discovery. 

Background of the Invention 

Pantothenic acid (vitamin B5) is found in coenzyme A (CoA) and 
the acyl carrier protein • (ACP) ^ both of which are involved in 
fatty acid metabolism. 

Pantothenic acid can be synthesised by plants and 
microorganisms but animals are apparently unable to make the 
vitamin, and require it in their diet. However, all organisms 
are able to convert pantothenic acid to its metabolically 
active form, coenzyme A. 

The pathway for the synthesis of pantothenic acid is shown in 
Fig. 1. It provides a potential target for the treatment of 
infectious disease, since inhibitors of the pathway should be 
damaging -to bacteria and. fungi, but. not „to ..human or animal„ .. .. ... . 

subjects infected by bacteria. 

Of specific interest is pantothenate synthetase (D-pantoat.e : 
p--alanine ligase (AMP-f orming) ; EC 6.3.2,1). This enzyme 
catalyses the condensation between p-alanine and pantoic acid, 
the final steps in pantothenic acid biosynthesis. Inhibitors 
(whether competitive, non-competitive, uncompetitive or . 
irreversible) of pantothenate synthetase would be of 
significant technical and commercial interest. 
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Purification of pantothenate synthetase (PS) to homogeneity 
was achieved by Miyatake et, al, (J. Blochem. , 79, . (1976), 
673-678) . The enzyme was reported to require stoichiometric 
amounts of ATP as an energy source which is hydrolysed to AMP 
5 and inorganic pyrophosphate. The mechanism of the enzymic 
reaction involves pantoate adenylate as an intermediate. 

However until now no one has successfully determined the 
structure of PS. This has prevented PS inhibitors being 
10 developed via structure-based drug design methodologies. 

Knowledge of the structure of PS would significantly assist 
the rational design of novel therapeutics based on PS 
inhibitors . 

15 Summary of the Invention 

The present invention is at least partly based on overcoming x 
several technical hurdles: we have (i) produced PS crystals of 
suitable quality, including crystals of selenium atom PS 

20 derivatives, for performing X-ray diffraction analyses, (ii) 
collected X-ray diffraction data from the crystals, (iii) 
determined a high resolution three-dimensional X-ray crystal 
structure of PS (i.e. a structure having a resolution which is 
numerically lower than 2 A) , and (iv) identified sites on the 

25 enzyme which are likely to be involved in the enzymic 
reaction. 

In general aspects, the present invention is concerned with 
identifying or obtaining agent compounds (especially 
30 inhibitors of PS) for modulating PS activity, and. in preferred 
embodiments identifying or obtaining actual agent 
compounds /inhibitors. Crystal structure information presented 
herein is useful in designing potential inhibitors and 
modelling them or their potential interaction with the PS 
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binding cavity. Potential inhibitors may be brought into 
contact with PS to test for ability to interact with the PS 
binding cavity. Actual inhibitors may be identified from 
among potential inhibitors synthesized following design and 
model work performed in silico. An inhibitor identified using 
the present 'invention may be formulated into a composition, 
for instance a composition comprising a pharmaceutically 
acceptable excipient, and may be used in the manufacture of a 
medicament for use in a method of treatment. These and other 
aspects and embodiments of the present invention are discussed 
below. 

In a first aspect, the present invention provides a crystal , of 
PS having a monoclinic space group . P2i/ and unit cell 
dimensions of a = 66.0±0.2A, b = 78.1±0-2A, c = 77.1+0.2A anc*;. 
p = 103.7+0.2**. Preferably the PS is a dimer. 

In a further aspect, the invention also provides a. crystal of 
PS having the three dimensional atomic coordinates of . Table -.1. 
An advantageous feature of the structure defined by the atomic 
coordinates is theft it has a high resolution.,, of about 1.7A* / 

The coordinates of Table 1 provide a measure of atomic 
location in Angstroms, to a third decimal place. The 
coordinates are a relative set of positions that define a 
shape in three dimensions, but the skilled person would 
understand that an entirely different set. of coordinates 
having a different origin and/or axes could define a similar 
or identical shape. Furthermore, the skilled person would 
understand that varying the relative atomic positions of the 
atoms of the structure so that the root mean square deviation 
of the residue baclcbone atoms (i.e. the nitrogen-carbon-carbon 
backbone atoms of the protein amino acid residues) is less 
than 1.5 A (preferably less than 1.0 A and more preferably 
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less than 0.5 A) when superimposed on the coordinates provided 
in Table 1 for the residue backbone atoms, will generally 
result in a structure which is substantially the same as the 
structure of Table 1 in terms of both its structural 
5 characteristics and potency for structure-based design of PS 

inhibitors. Likewise the skilled person would understand that 
changing the number and/or positions of the water molecules 
and/or substrate molecules of Table 1 will not generally 
affect the potency of the structure for structure-based design 

10 of PS inhibitors- Thus for the purposes described herein as 

being aspects of the present invention, it is within the scope 
of the invention if: the Table 1 coordinates are transposed to 
a different origin and/or axes; the relative atomic positions 
of the atoms of the structure are varied so that the root mean 

15 square deviation of residue backbone atoms is less than 1.5 A 
(preferably less than 1.0 A and more preferably less than 0.5 
A) when superimposed on the coordinates provided in Table 1 
for the residue backbone atoms; and/or the number and/or 
positions of water molecules and/or substrate molecules is 

20 varied. Reference herein to the coordinate data of Table 1 
thus includes the coordinate data in which one or more 
individual values of the Table are varied in this way- By 
^^root mean square deviation" we mean the square root of the 
arithmetic mean of the squares of the deviations from the 

25 mean. 

Thus, for example, varying the atomic positions of the atoms 
of .the structure by up to about 0.2 A in any direction will 
result in a structure which is substantially the same as the 
30 structure of Table 1 in terms of both its structural 

characteristics and utility e.g. for structure-based drug 
design. 

The provision of the high resolution structure of Table 1 
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provides those of skill in the art with a detailed insight 
into the mechanisms of action of PS. This insight provides a 
means to design new antibacterial agents which have the 
potential to inhibit the process of pantothenate synthesis in 
bacteria and fungi, or to modulate the activity of the enzyme, 
for example such that the enzyme works more effectively on 
prodrugs which are converted by PS into an antibacterial drug. 

In a further aspect, the invention provides a method for 
crystallizing a selenium atom PS derivative which comprises 
producing PS by recombinant production in a bacterial host . 
(e.g. JS. coll) in the presence of selenomethionine , recovering 
a selenium atom PS derivative from the host and growing ...-.-^r;:, 
crystals from the recovered selenium atom PS derivative. 

Thus, the selenium atom PS derivative and PS produced by 
crystallising native PS (see the detailed description below),; 
are provided as crystallised proteins suitable for X-ray 
diffraction analysis. 

The crystals may be grown by any suitable method;^ . e.g. the . 
hanging drop method. 

The above aspects of the invention, both singly and in 
combination, all contribute to features of the invention which 
are advantageous. 

The provision of the crystal structure of PS allows a novel 
approach for drug discovery for modulators of this enzyme. 
Accordingly, the invention provides a computer-based method of 
rational drug design which comprises: 

providing the structure of the PS as defined by the 
coordinates of Table 1; 

providing the structure of a candidate modulator molecule; 
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and 

fitting the structure of the candidate modulator molecule to 
the structure of the PS of Table 1. 

5 In an alternative aspect^ the method of the invention may utilise 
the coordinates of atoms of interest of the PS which are in the 
vicinity of a putative substrate and/or co-factor binding regions 
in order to model the pocket in which the substrate or co-factor 
binds. These coordinates may be used to define a space which is 

10 then screened ^^in sillco^'^ against a candidate modulator molecule. 
Thus the invention provides a computer-based method of rational 
drug design which comprises: 

providing the coordinates of at least . two . atoms of the PS of 
Table 1 (''^selected coordinates") ; 

15 providing the structure of a candidate modulator molecule; 

and 

fitting the structure of the candidate modulator molecule to 
the selected coordinates of the PS. 

20 In practice, it will be desirable to model a sufficient number of 
atoms of the PS as defined by the coordinates of Table 1 which 
represent a binding pocket. Binding pockets and other features 
of the interaction of PS with co-factor are described in the 
detailed description. Thus, in this embodiment of the invention, 

25 there will preferably be provided the coordinates of at least 5, 
preferably at least 10, more preferably at least 50 and even more 
prefersLbly at least 100 selected atoms of the PS structure. 

Our structure of PS has allowed us to identify particular sites 
30 of interaction of Mg^***^ ATP, pantoate and p-alanine. The 
selected coordinates preferably include at least one of the 
coordinates defining these particular sites. Residues providing 
some of these sites include Pro28, Met30, His34, Asp35, Gly36, 
His37, Leu40, Asn58, Gln61, Phe62, TyrTl, Argl23, Hisl26, Ilel33, 
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Vall34, Leul37, LyslSl, Glnl55, Metl78, Alal85, Leul86, Serl87, 
Series, and Argl8 9. 

As discussed in the detailed description, we believe that ATP 
interacts with one or more of Met30, His34, Asp35, G1y36, His37, 
Leu40, LyslSl, Metl78, Alal85/ LeulB6, Serl87., Serl88, and 
Argl89; Mg^"^ interacts with either or both of Tyr71 and Serl88; 
pantoate interacts with one or more of Pro28, Met30, Asn58, 
Gln61, Ilel33, Vall34, Leul37, and Glnl55; and (3-alanine 
interacts with one or more of MetSO, Phe62, Tyr71, Argl23, and 
Hisl26. 

In another aspect, the method of the invention may utilise a :S.\ib- 
domain of interest of the PS which is in the vicinity of a region 
which binds substrate or co- factor . Thus, the . invention provides**" 
a computer-based method of rational drug design which comprises : 
providing the coordinates of at least a sub-domain of the 

PS; ■ 

providing the structure of a candidate modulator molecule; 

and . 

fitting the structure of the candidate modulator molecule . 
to the coordinates of the PS sub-domain provided. 

Brief Description of the Drawings 

Fig • 1 shows schematically the pathway for the synthesis of 
pantothenic acid, ' 
Figs. 2a-c show the general structure of PS, being 
respectively (a) a ^^cartoon" of the dimer, (b) a schematic 
diagram of the monomer topology with numbering of secondary 
strxictures, and (c) a schematiq plot of hydrogen bonding 
patterns between .secondary structures. 

Fig. 3 is a stereo pair of images showing schematically the 
core of the dimerisation interface, and 
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Fig. 4 shows a Connolly surface generated around the proposed 
PS active sites. 



Detailed Description of the Invention 

5 

By ^'fitting", it is meant determining by automatic^ or semi- 
automatic means ^ interactions between at least one atom of the 
candidate and at least one atom of the PS, and calculating the 
extent to which such an interaction is stable. Interactions 
10 include attraction and repulsion, brought about by charge, 
steric considerations and the like. Various computer-based 
methods for fitting are described further herein. 



By ^'sub-domain" is meant at least one (e.g. one, two, three or 
15 four) complete element (s) of secondary structure, i.e. an 
alpha helix or a beta sheet, as described in the detailed 
description below. 

Table 1 gives atomic coordinate data for PS (which we have 
20 crystallised as a dimer) and associated water molecules. In 
Table 1 ''Atom type" refers to the respective element, the first 
letter defining the element; '^X, Y, Z" define, with respect to 
the crystallographic axes, the atomic position (in A) of the 
respective atom; '"Occ." is the occupancy . of the atom in the 
25 respective position; and ''B" is a temperature factor (in A^) 
which accounts for movement of the atom around its atomic centre. 

Particular regions of the PS include those identified as putative 
substrate or cofactor binding regions based on the data provided 
30 in Table 1. 



As indicated above,, modulators of PS may be inhibitors of the 
enzyme or compounds which affect its specificity or activity 
in relation to pantoate in other ways. The invention is 
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particularly suitable for the design, screening and 
development of PS inhibitor • components . It is thus a 
preferred aspect of the invention that modulators are 
inhibitors . 

The step of providing the structure of a candidate modulator 
molecule may involve selecting the compound by computationally 
screening a database of compounds for interaction with the 
active site. For example, a 3~D descriptor for the potential 
modulator may be derived, the descriptor including geometric 
and functional constraints derived from the architecture and 
chemical nature of the active site. The descriptor may then 
be used to interrogate the compound database, a potential 
modulator being a compound that has a good match to the ^{ 
features of the descriptor. In effect, the descriptor is ar'-r-^- 
type of virtual pharmacophore. 

In any event, the determination of the three-dimensional 
striucture of PS provides a basis for the design of new and . . 
specific ligands for PS.. For example, knowing the three- 
dimensional structure of PS, computer modelling programs may 
be used to design different molecules expected to interact 
with possible or confirmed active sites, such as binding sites 
or other structural or functional features of PS. 

More specifically, a potential modulator of PS activity can be 
examined through the use of computer modelling using a docking 
program such as GRAM, DOCK, or AUTODOCK (see Walters et al.. 
Drug Discovery Today, Vol.3, No. 4, (1998), 160-178, and 
Dunbrack et al.. Folding and Design, 2, (1997), 27-42) to 
identify potential inhibitors of PS. This procedure can 
include computer fitting of potential inhibitors to PS to 
ascertain how well the shape and the chemical structure of the 
potential inhibitor will bind to the enzyme. 
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Also computer-assisted, manual examination of the active site 
structure of PS may be performed. The use of programs such as 
GRID (Goodford, J. Med. Chem. , 28, (1985), 849-857) - a 
5 program that deteirmines probable interaction sites between 
molecules with various functional gjroups and the enzyme 
surface - may also be used to analyse the active site to 
predict partial structures of inhibiting compounds. 

10 Computer programs can be employed to estimate the attraction, 
repulsion, and steric hindrance of the two binding partners 
(e.g. the PS and a potential inhibitor) . Generally the 
tighter the fit, the fewer the steric hindrances, and the 
greater the attractive forces, the more potent the potential 

15 modulator, since these properties are consistent with a 

tighter binding constant- Furthermore, the more specificity 
in the design of a potential drug, the more likely it is that 
the drug will not interact with other proteins as well. This 
will tend to minimise potential side-effects due to unwanted 

20 interactions with other proteins. 

In a further aspect, the present invention provides . a method 
for identifying a candidate modulator (e.g. potential 
inhibitor) of PS comprising the steps of: 

25 employing a three-dimensional structure of PS, or at 

least one sub-domain thereof, , to characterise at least one PS 
active site, the three-dimensional structure being defined by 
atomic coordinate data according to Table 1; and 

identifying the candidate modulator by designing or 

30 selecting a compound for interaction with the active site. 

If more than one PS active site is characterised and a 
plurality of respective compounds are designed or selected, 
the modulator may formed by linking the respective compounds 
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into a larger compound which maintains the relative positions 
and orientations of the respective compounds at the active 
sites . The larger compound may be formed as a real molecule 
or by computer modelling. 

The step of identifying the candidate modulator may involve 
selecting the compound by computationally screening a database 
of compounds for interaction with the active site. 

In another aspect, in place of In sllico methods, high 
throughput screening of compounds to select compounds with 
binding activity may be undertaken, and those compounds which 
show binding activity may be selected as possible candidatej^^:.-:. 
modulators, and further crystallized with PS (e.g. by co- a. 
crystallization or by soaking) for X-ray analysis. The 
resulting X-ray structure may be compared with that, of Tableii.! 
for a variety of purposes. For example, where the contacts 
itiade" by such compounds overlap with those may by pan toate , T:: 
novel molecules comprising residues which contain contacts o.f 
both pantoate and the other compound may be provided. 

Having designed or selected possible binding candidate 
modulators by determining those which have favourable fitting 
properties (e.g. strong attraction between candidate and PS), 
these can then be screened for activity. Consequently, the 
method preferably further comprises the steps of: 

obtaining or synthesising the candidate modulator; and 
contacting the candidate modulator with PS to determine 
. the ability of the candidate modulator to interact with PS . 

More preferably, in the latter step the candidate modulator is 
contacted with PS under conditions to determine its function . 

For example, in the contacting step above the candidate 
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modulator is contacted with PS in the presence of a . substrate, 
and typically a buffer, to determine the ability of said 
candidate modulator to inhibit PS. The s\ibstrate may be e.g. 
pantoic acid (or a salt thereof) , p-alanine (or a salt 
5 thereof) , or ATP. So., for example, an assay mixture for PS 
may be produced which comprises the candidate modulator, 
substrate and buffer. 

Instead of, or in addition to, performing such an assay, the 
10 method may comprise the further steps of: 

obtaining or synthesising said candidate modulator; 
forming a complex of PS and said candidate modulator; and 
analysing said complex. by X-ray crystallography to 
determine the ability of said candidate modulator to interact 
15 with PS. Detailed structural information can then be obtained 
about the binding of the candidate modulator to PS, and in the 
light of this information adjustments can be made to the 
structure or functionality of the candidate modulator, e.g. to 
improve binding to the active site. The above steps may be 
20 . repeated and re-repeated as necessary. 

In another aspect, the invention relates to a method of 
determining three dimensional structures of PS homologues of 
unknown structure by utilising the structural coordinates of 
25 Table 1. 

For example, if X-ray crystallographic or NMR spectroscopic 
data is provided for a PS homologue of unknown structure, the 
structure of PS as defined by Table 1 may be used to interpret 
30 that data to provide a likely structure for the PS homologue 
by techniques which are well known in the art, e.g. phase 
modelling in the case of X-ray crystallography. 

One embodiment of the method comprises the steps of: 
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(a) aligning a representation of an aihino acid sequence 
of a PS horaologue of unknown structure with the amino acid 
sequence of PS to match homologous regions of the amino acid 
sequences ; 

(b) modelling the structure of the matched homologous 
regions of the PS of unknown structure on the structure as 
defined by Table 1 of the corresponding regions of PS; and 

(c) determining a conformation (e,g- so that favourable 
interactions are formed within the PS of unknown structure 
and/or so that a low energy conformation is formed) for the PS 
of unknown structure which substantially preserves the 
structure of said matched homologous regions . . ^ 



The term ^^homologous regions" describes amino acid residues; in.,- 
two sequences that are identical or have similar (e.g. :^!;..^^^^^ 
aliphatic, aromatic, polar, negatively charged, or positively 
charged) side-chain chemical groups. Identical and similar 
residues - in homologous regions are sometimes described as . i 
being respectively ^^invariant" and ^'conserved" by those 
skilled in the art. 

Preferably one or all of steps (a) to (cj are performed by 
computer modelling. Homology modelling is a technique that is 
well known to those skilled in the art (see e.g. Greer, 
Science, Vol. 228, (198.5), 1055, and Blundell et al., Eur. J. 
Blocbem, Vol. 172, (1988), 513).. 

In general, comparison of amino acid sequences is accomplished 
by aligning the amino acid sequence of a polypeptide of a 
known structure with the amino acid sequence of the 
polypeptide of unknown structure. Amino acids in the 
sequences are then compared and groups of amino acids that are 
homologous are grouped together. This method detects 
conserved regions of the polypeptides and accounts for amino 
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acid insertions or deletions. 

Homology between amino acid sequences can be determined using 
commercially available algorithms. The programs BLAST, gapped 
5 BLAST, BLASTN and PSI-BLAST (provided by the National Center 
for Biotechnology Information) are widely used in the art for 
this purpose, and can align homologous regions of two amino 
acid sequences. 

10 Once the amino acid sequences of the polypeptides with known 
and unknown structures are aligned, the structures of the 
conserved amino acids in a computer representation of the 
polypeptide with known structure are transferred to the 
corresponding amino acids of the polypeptide whose structure. 

15 is unknown. For example, a tyrosine in the amino acid 
sequence of known structure may be replaced by a 
phenylalanine, the corresponding homologous amino acid in the 
amino acid sequence of unknown structure. 

20 The structures of amino acids located in non-conserved regions 
may be assigned manually by using standard peptide geometries 
.or by molecular simulation techniques, such as molecular 
dynamics. The final step in the process is accomplished by 
refining the entire structure using molecular dynamics and/or 

25 energy minimization. 

The aspects of the invention described herein which utilise 
the PS structure in sillco may be equally applied to hbmologue 
models of PS obtained by the above aspect of the invention, 
30 and this application forms a further aspect of the present 

invention. Thus having determined a conformation of a PS by 
the method described above, such a conformation may be used in 
a computer-based method of rational drug design as described 
herein . 
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In another aspect, the invention includes a compound, which is 
identified as a modulator (preferably an inhibitor) of PS by 
the methods of the invention described above. 

Following identification of an inhibitor compound, it may be 
manufactured and/or used in the preparation, i.e. manufacture 
or formulation, of a composition such as a medicament, 
pharmaceutical composition or drug. These may be administered 
to individuals . 

Thus, the present invention extends in various aspects not 
only to an inhibitor as provided by the invention, but alsoj^^a,/ 
pharmaceutical composition, medicament, drug or other ^ 
composition comprising such an inhibitor e.g. for treatment- 
(which may include preventative treatment) of disease such as 
microbial infection; a method comprising administration of 
such a composition to a patient, . e.g. for treatment of diseajse. 
such as microbial, infection; use of such an inhibitor in the 
manufacture of a composition for administration, e.g, for 
treatment of disease such as microbial infection;,^, and a method 
of making a pharmaceutical composition comprising admixing 
such an inhibitor with a pharmaceutically acceptable 
excipient, vehicle or carrier, and optionally other 
ingredients. 

In a further aspect, the invention provides a method for 
determining the structure of a modulator of PS bound to. PS, 
said method comprising: 

providing a crystal of PS according to the invention; 
soaking the crystal with said modulator; and 
determining the structure of said PS-modulator complex. 

Alternatively, the PS and modulator may be co-crystallized. 
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In either case, pantoate, p-alanine and/or pantothenate or an 
analogue thereof may optionally be present. 

Having obtained and characterized a modulator compound 
5 according to the invention, the invention further provides a 
method for modulating the activity of PS which method 
comprises: 

providing PS under conditions where, in the absence of 
modulator, the PS is able to synthesize pantothenate from 
10 pantoate; 

providing a modulator compound; and 

determining the extent to which the activity of PS is 
altered by the presence of said compound. 

15 In another aspect, the present invention provides systems, 
particularly a computer system, intended to generate 
structures and/or perform rational drug design for PS or a 
complex of PS and a potential modulator, the systems 
containing either (a) atomic coordinate data according to 

20 Table 1, said data defining the three-dimensional structure of 
PS or at least one sub-domain thereof, or (b) structure factor 
data for PS, said structure factor data being derivable from 
the atomic coordinate data of Table 1. 

25 In a further aspect, the present invention provides computer 
readable media with either (a) atomic coordinate data 
according to Table 1 recorded thereon, said data defining the 
three-dimensional structure of PS, at least one atom or at 
least one sub--domain thereof, or (b) structure factor data for 

30 PS recorded thereon, the structure factor data being derivable 
from the atomic coordinate data of Table 1. 

As used herein, ^^computer readable media" refers to any medium 
or media which can be read and accessed directly by a 
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computer. Such media include, but are not limited to: 
magnetic storage media such as floppy discs, hard disc storage 
medium and magnetic tape; optical storage media such as 
optical discs or CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as 
magnetic/optical storage media. 

By providing such computer readable media, the atomic 
coordinate data can be routinely accessed to model PS or a 
sub-domain thereof. For example, RASMOL (Sayle et al . , TIBS, 
Vol. 20, (1995), 374) is a publicly available computer 
software package which allows access and analysis of atomic-, 
coordinate data for structure determination and/ or rationai::%. . 
drug design. 

On the other hand, structure factor data, which are derivable 
from atomic coordinate data (see e.g. Blundell et al., in .r. 
Protein Crystallography, Academic Press, TSIew York, London and 
San ..Francisco, (1976) ), are particialarly useful for 
calculating e.g. difference . Fourier electron density, maps. 

As used herein, ^^a computer system" refers to the hardware 
means , software means and data storage means used to analyse 
the atomic coordinate data of the present Invention. The 
ininimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU) , 
input means, output means and data storage means. Desirably 
monitor is provided to visualise structure data. The data 
storage means may be RAM or means for accessing computer 
readable media of the sixth aspect of the invention. Example 
of such systems are microcomputer workstations available from 
Silicon Graphics Incorporated and Sun Microsystems running 
Unix based, Windows NT or IBM OS/2 operating systems. 
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In a further aspect^ the invention provides a method of 
analysing a complex of PS and a potential modulator comprising 
the step of employing (i) X-ray crystallographic diffraction 
data from the complex and (ii) a three-dimensional structure 
5 of PS^ or at least one sub-domain thereof, to generate a 

difference Fourier electron density map of the complex, the 
three-dimensional structure being defined by atomic coordinate 
data according to Table 1. The difference Fourier electron 
density map may then be analysed. 

10 

■Therefore, such complexes can be crystallised and analysed 
using X-ray diffraction methods, e.g. according to the 
approach described by Greer et al., J. of Medicinal Chemistry, 
Vol. 37, (1994), 1035-1054, and difference Fourier electron 
15 density maps can be calculated based on X-ray diffraction 
patterns of soaked or co-crystallised PS and the solved 
structure of uncomplexed PS. These maps can then be analysed 
e.g. to determine whether and where a particular ligand binds 
to PS and/or changes the conformation of PS. 

20 

Electron density maps can be calculated using programs such as 
those from the CCP4 computing package (Collaborative 
Computational Project 4. The CCP4 Suite: Programs for Protein 
Crystallography, Acta Crystallographicar D50, (1994), 760- 
25 763.) . For map visualisation and model building programs such 
as (Jones et al-, Acta Crystallograhy, A47, (1991), 110- 

119) ca.n be used. 

PS Structural Characterization 

30 

We have found that the structure of a PS monomer consists of 
two major domains, joined at about residue 176 (Figs. 2a-c) . 
Domain N (so called because it contains the N terminal) has an 
alpha-beta-alpha architecture; six parallel p-strands with 1»- 
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3-2-1-4-5 topology alternate with a-helices to form a Rossman 
fold with central p-sheet sandwiched between two layers of a- 
helices (Fig. 2b). The helices (al',1, 2, 3 and 4) pack 
against the p-sheet in a right-handed way. The secondary 
structural elements have been numbered in Figs. 2a and b, with 
elements that are insertions or additions to the ^'standard" 
nucleotide-binding Rossman fold (discussed below under 
^"Identification of Likely Active Sites'') denoted by primes. 
Strand P5 leads directly into the short (3-hairpin and 3io helix 
motif (P6, P7 and Eio7), which lies at the head of domain C 
(containing the C teimiinal) and is likely to be involved in 
phosphate binding (see below) . The rest of the domain has ,a 
simple two-layer organisation: a helix-turn-helix layered . 
above a flat sheet of three anti-parallel p-strands {a8 an(d 
piO-12) • This sheet faces a prominent cleft in domain N , tiie^?:-- 
predicted catalytic region (see below) , making the whole 
structure resemble somewhat a pot (domain N) with its lid 
(domain C) on a hinge, a common arrangement in two-domain 
enzymes . 

We have also found that the two monomers, A and.©., of PS are .: 
related by a non-crystallographic quasi 2-fold rotational, 
symmetry (NCS) axis- The dimerisation interface has a surface 
area of 1340 A^ and the core of the interface is shown in Fig. 
3. The centre of the nearly symmetrical dimerisation 
interface is unusual: below a 2-strand p-sheet (pD from A and 
B) Vall09, Metl66 and Phel68 form a hydrophobic pocket around 
weakly H-bonded polar clusters of Serl35, conserved Asnl39 and 
three water molecules, one of which lies on the NCS axis. 
Above the (5-sheet TyrlOS, AspllO and Argl28 form a tight 
charged cluster, and the rest of the interface consists of 
salt bridges (Hisl06 to Aspl65; Argil to Aspl69) and extensive 
water-mediated H-bonding interactions. 
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The average B-factor of monomer B is about 4 greater than 
that of A, which on the whole contains fewer disordered 
stretches. Also conformational differences between the 
monomers which can be explained by crystal packing 
5 arrangements are found at residues 173-180 and 187-193. 

For residues B187-193, electron density was poor, and the 
apparent backbone connectivity could not be reconciled with 
stereochemical and Ramachandran constraints. The loop was 

10 eventually modelled using the same residues from monomer A 
(which are well ordered) , and transformed by the operation 
that superimposes domain C of monomer A onto monomer B. 
However, it is likely that residues A187-193 are only ordered 
because the bottom of the dimerisation region is 

15 crystallographically packed tightly against this region and 
that the disordered seen in B is more realistic for the apo-* 
enzyme in vivo. 

Residues 239-244 also have entirely different but defined 
20 backbone conformations in the two monomers, and this 

difference is not readily explained by crystal packing. 
However, there appears to be no functional significance in the 
anomaly . 

25 Solving the PS Crystal Structure 

To solve the PS crystal structure, molecular replacement was 
not possible because prior to our determination of the PS 
structure similarities between the amino acid sequence of 
30 E.coli PS and that of proteins with known structures were not 
evident. Therefore, phase information needed to be obtained 
ab initio. 

The phase problem was first approached by the Multiple 
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Isomorphous Replacement technique, and crystals of PS were 
soaked with a range of heavy atom salts at a range of 
concentrations- However, the majority of these conditions 
resulted in crystal damage. 

Eventually, production of selenomethionine PS (SeMet PS) was 
attempted, the selenium atoms being introduced into the 
protein prior to crystallisation by recombinant production of 
the protein in the presence of L-selenomethionine . This was 
successfully accomplished and is discussed in more detail 
below. X-ray analysis was performed on PS and SeMet PS 
crystals. 

1, Production and purification of PS . . i^w: 
Natisre PS^ ■ ''^ 

DNA encoding the PanC gene was engineered into a pDC19 
expression vector. E.coli cells were transformed using the - 
plasmid. 

Colonies of transformed cells were inoculated directly into LB 
medium containing ampicillin (100 mg/ml) and IPTG (70mg/ml) ; 
induction of expression was continuous. The cultures were 
shaken (200 rpm on an orbital shaker) overnight at 37 'c^ when 
the cells were retrieved by centrifugation of the culture 
medium and the cell pellet stored frozen at -80*C. 

Selenomethionine PS 

The same E.coli strain was used as for native expression, but 
the methionine pathway inhibition system (see van Duyne et 
al., J. Mol. Biol. r 229, (1993), 105-124) was used for 
selenomethionine incorporation. Cells were grown on a 
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minimal/ defined medium (see Table 2) containing 
selenomethionine as well as six other amino acids,, whose 
presence inhibits the natural pathways for methionine 
synthesis. A starter culture (100ml) of the same medium as 
5 above, but without selenomethionine or the inhibitory amino 

acids, was inoculated with transformed cells and grown at 37 'c 
to log growth phase . 1ml of this culture was used to 
inoculate baffled 2/Erlemeyer flasks (250ml complete medium 
per flask) which were shaken at 37°C overnight and harvested as 
10 for native protein. 

Purification 

Harvested cells were suspended in 20-4 0ml TD buffer (50mM 
15 Tris/HCl pH 7.5 + 0 . ImM dithiothreitol) sonicated at maximum 
intensity for 8 times 15 seconds, with 15 second breaks, and 
cell debris removed by centrif ugation (30 minutes, 15000x g) . 

The supernatant was stirred at 4*^0 while (NH4)2S04 was added 
20 slowly over ca. 15 minutes to a final concentration of 29.1% 
(w/v) ; after a further 30 minutes of stirring, precipitated 
contaminants were removed by centrifugation (30 minutes, 
1500 Ox g) . The solution was dialysed overnight against TD 
buffer (at least 21). 

25 

The dialysed protein solution was loaded at 4"c onto an anion 
exchange column (Pharmacia Q-Sepharose, 16/10) and eluted with 
TD buffer against a NaCl gradient of 0 to 500mM in 75 minutes^, 
at a flow rate of 5ml/min. The protein eluted between 0.21 
30 and 0.24M NaCl. The protein-containing fractions were 

selected from SDS-PAGE analysis, and concentrated to ca 1ml, 

The concentrated fractions were loaded at 4'c onto a size 
exclusion coluian (Pharmacia S200HR) , and eluted with TD buffer 
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containing NaCl at 500mM. The fractions containing PS were 
confirmed by SDS-PAGE analysis. The fractions were pooled and 
dialysed overnight against TD buffer (at least 21) . 

The dialysed protein solution was loaded at rooin temperature 
onto an affinity column (Pharmacia Blue Sepharose HiLoad 
16/10) and eluted with at least five column volumes of TD 
buffer containing lOmM ATP. This effectively eluted all the 
protein, although this was not monitored directly. . 

ATP was removed from the eluant by repeated cycles (at least 
5) of concentration (in a stirred cell concentrator (Amicon7 
Ultrafiltration Ceil) under pressure in an N2 atmosphere) and 
dilution with TD buffer; ATP content was monitored by the .UV 
spectrum (220-300nm) of the solution. The protein was fina-lly:' 
concentrated (Ultrafree7 concentrator) to a concentration ofe. 
between 20 and 30.mg/ml. At this concentration, the solution 
could be aliquoted and frozen directly at -80°C without damage 
to the:^:5)rotein. - ^v. 

For the purification of the SeMet protein, some.. precautions • 
were taken to minimise oxidation of the selenium in the 
protein. The DTT concentr.ation in all buffers was raised to 
5mM, all buffers were thoroughly purged with N2 gas before use 
and the whole procedure was completed as fast as possible 
within two days. The SeMet preparations of PS were subjected 
to Elect rospr ay Mass Spectrometry (ESMS) to confirm the 
incorporation of selenomethionine during the expression. 

2. Preparation of Crystals. 

Crystals of PS and SeMet PS were grown using the hanging drop 
vapour diffusion method. Protein (20mg/ml) was mixed on a 1:1 

ratio with crystallisation solution containing 4-7% (w/w) 

I 
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Polyethylene Glycol 4000 and 50mM Tris/HCl buffer at pH8 . 
Crystals formed within 2-4 days at 19**C, Crystallisation of 
SeMet PS, was performed using a nearly identical protocol, but 
additionally, 2mM DTT was added to the crystallisation 
5 solution before mixing the drop. 

Crystals ideally have approximate dimensions of 600x200x50 pm. 
Under non-optimal conditions, crystals grow in clusters and 
are generally much thinner in the 3^^ dimension (10-20 ]junri) . 

10 

Crystals of PS were cryo-protected using a protocol of gradual 
soaking in the cryo-protectant, glycerol. A crystal was 
placed in 20ul of crystallisation solution, and the 
concentration of glycerol is gradually increased to 28% (v/v) 
15 in 4% increments. 

3 . Structural Determination 

Multi-wavelength data sets were collected from a cryo-cooled 
20 crystal of SeMet PS, on beam line X-25 of the NSLS at 

Brookhaven National Laboratories on Long Island, USA. This is 
a high-flux station with good intensity and wavelength 
stability. The presence of selenomethionine in the protein 
was confirmed independently by electrospray mass spectrometry. 
25 Before the experiment, a large number of crystals were 

extensively screened for highest resolution, low mosaicity and 
low background scatter. 

Terminal radiation-induced diffraction decay was evident in 
30 the first crystal to be exposed, which influenced data 
collection from the second, final SeMet crystal. 

In addition to the three data sets collected from SeMet 
crystals, a data set was collected from a large native 
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crystal, which had been established to be nearly isomorphous 
with the SeMet crystals used- In order to have complete but 
also high resolution data, the same oscillation range was 
exposed twice, the first for measuring low resolution data 
(i.e. short exposures), and the second for the highest 
resolution possible (long exposures) . All data were processed 
using MOSFLM (Leslie, Joint CCP4 and EESF-EACMB Newsletter on 
Protein Crystallography, Vol .26., Daresbury Laboratory, UK) and 
scaled with SCALA (Collaborative Computational Project 4. The 
CCP4 Suite: Programs for Protein Crystallography, Acta 
CrystallographicBr D50, (1994) , 760-763) . 

The selenium atoms were located using the program SnB. (Weeks^ ::. 
et al., J. of Applied Crystallography, 32, (1999), 120-124^;. . 
and their positions refined using SHARP (LaFortelle et al.. 
Methods in Enzymologyr 216, (1997), 472-494 and LaFortelle et 
al,,. Maximum Likelihood Refinement in a. Graphical environmenrt., 
with SHARP, in CCP4 study week-end: Recent Advances in 
Phasingr ed. Wilson et al., Daresbury Laboratory, UK). The 
final model contained 19 selenium sites which were used to . 
provide initial phasing. Solvent flattening and phase 
extension techniques were used to produce an interpretable 
electron density map. 



The program O was used for model building. The experimental, 
solvent flattened electron density map was readily 
interpretable and secondary structural elements were clearly 
defined in the electron density bones (calculated with MAPMAN, 
see Kleywegt et al.. Acta Crystallographica, D52, (1996b), 
82 6-828) . The main chain of one monomer could be traced 
nearly continuously, using the secondary structure template 
building functionality in O, and the selenium atoms identified 
using SHARP providing guidance for chain^tracing. 
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The complete main chain model of monomer A was manually 
rotated to correspond with the bones of the second monomer 
(B) . Since the relative orientation of the two domains was 
slightly different in monomer B, it was optimised by rigid 
5 body refinement (using REFMAC, see Murshudov et al.. Acta 

CrystallographlcBr D53, (199.7), 24-255), keeping separate, the 
two domains (residues 1-176 and 177-283) . 

The model was improved by three iterated cycles of restrained 
10 and individual isotropic maximum likelihood refinement with 

REFMAC (40--1.7D resolution) together with manual rebuilding in 
0. OA-weighted 2Fobs-Fcaic and Fobs-Fcaic maps were used (Read, 
Acta Crystallographicar A42, (1986), 140-149), the former 
frequently informative even when contoured at only 0.8-0.9 map 
15, standard deviations. For difficult parts of the model, maps 
and models resulting from simulated annealing in CNS (Brunger 
et al.. Acta Crystallographicar D54, (1998), 905-921) were 
also considered. Ordered water molecules were modelled by 
automated cycles of water addition and removal by ARP 
20 (Perrakis et al.. Acta Crystallographica, D55, (1999), 1765- 
1770) and refinement by REFMAC, with a final cycle of 
refinement with bulk solvent correction using CNS to ensure 
good geometry. 

25 The final model consists of 4290 non-hydrogen protein atoms, 
and 384 water molecules. All residues were modelled, but 
electron density was poor for C-terminal residues (A283, B282- 
3)., as well as residues B187-193; the B-factors of these 
residues are high, approaching 80A^. Residues A251-259, B63- 

30 68 and B251-259, though visible, are also not well ordered and 
have B-factors approaching 60A^. Two residues (A4 and A273) 
have alternative conformations, and 12 surface-exposed side 
chains are ' disordered and were modelled as the most common 
rotamer at zero occupancy. 
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Table 3 provides model parameters and refinement statistics 
for a version of the model which is essentially the same as 
that of Table 1 but contains more water molecules and also two 
ethanediol molecules and a Tris molecule. Residues B188-192 
of this version of the model were reconstructed using BUSTER 
(Bricogne, Methods In Enzymology, 276, (1997), 361-423) in its 
implementation with TNT (Tronrud, Methods in Enzymology r 211 , 
(1997), 306-319) instead of by the symmetry operation 
described above under ''PS Structural Characterization". The 
program DDQ (van den Akker et al.. Acta Crystallographicar 
D55, (1999), 206-218) was used to assess local and global 
accuracy and satisfactory completion of refinement, by 
considering difference density peaks arising from the final. ivi.; 
model. OA-weighted difference maps ..were calculated in REEmeVv- 
excluding water molecules from the model. Quality of the 
model and its geometry were assessed by OOPS (Kleywegt . et al:> r 
OOPS-a-daisy, CCP4/ESF-EACBM Newsletter on Protein \ I., 

Crystallography, 30, (1994), 20-24), PROCHECK (Laskowski et 
al., J. Applied Crystallography, 26, (1993), 283-291) and 
WHATCHECK. (Hooft et al.. Nature, 381, (1996), 27.2) . No 
serious deviations from expected values are present, and 
warnings either correspond to well-defined justifiable 
features or else poorly-visible features that have high B- 
f actors anyway. There are no Ramachandran outliers, and 92.2% 
of residues lie in most favoured regions of the plot. 

Identification of Likely Active Sites 

Having solved the PS crystal structure it is now evident that 
in terms of their Ca coordinates, the ATP-binding domains of 
(i) class I amino-acid tRNA synthetases (tRS) (i-e. EtRS from 
Thermus thermophilus , Nureki et al.. Science, 267, (1995) 
1958-1965; QtRS from £. coli, Perona et al.. Biochemistry, 32, 



wo 02/22793 



PCT/GB01/04a67 



-28- 

(1993.) 8758-8771; MtRS from Thermus aquations ^ Mechulam et 
al., J. of Molecular Biology r 294, (1999), 1287-1297; and YtRS 
from Bacillus stearothermophilus, Brick et al., J. of 
Molecular Biology, 208, (1989), 83-98), (ii) 
phosphopantetheine adenylyltransferase (PPAT) from E. coli 

(Izard et al., EMBO Journal, 18, (1999), 2021-2030) and (iii) 
CTP:glycerol-3-phosphate cytidylyltransf erase (CGT) from B. 
subtilis (Weber et al.. Structure with Folding and Design, 1, 

(1999), 1113-1124) are structurally similar to domain N of PS. 

More specifically, the particular class of Rossman fold which 
characterises tRS, CGT and PPAT consists of five p-strands in 
a central sheet and a cleft between p-strands (il and p4 at the 
adenosine-binding site (see Fig. 2c) . PS also has these 
features. In addition, in all four cases strand (35 is 
followed by catalytically important residues which form the 
KMSKS motif discussed below) , and for both PS and tRS strand 
|35 leads directly into the next domain. 

Furthermore, two sequence motifs, HIGH and KMSKS (Barker, et 
al., FEES Letters, 145, (1982), 191-193), are conserved in tRS 
proteins and also in the wider super family. From mutational 
studies (First et al., in Biochemistry, 32, (1993), 13644- 
13663) these motifs are known to be involved in ATP binding: 
the HIGH motif, binds the adenine portion of ATP (cytidine in 
GGT) and the KMSKS motif stabilises the (3- and y^phosphate 
groups. These motifs are also found in PS and correspond 
respectively to residues 34-37 and 185-189. 

The location of the bound ATP adenine in the structure of QtRS 
corresponds to within 2 to 3D of the positions of the bound 
nucleotides in YtRS, PPAT and CGT, i.e. in the cleft between 
strands pi and P4 of the Rossman fold and against the top of 
helix al (the location of the HIGH motif) . When this domain 
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of QtRS is aligned with domain N of PS the HIGH (actually HDGH 
in PS) residues line up very well and the QtRS -bound ATP fits 
nearly perfectly into the same cleft in PS. Despite this 
excellent match, there is a difference in the positions of the 
5 helices EiqV (in PS) and al (in QtRS) relative to the Rossman 
domain. This is the location of the KMSKS motif. However, by 
changing conservatively the cp/n-angles of residues Vall75, 
Prol76, Ilel77 and Metl78 which form the PS inter-domain 
linker main chain, domain C can be rotated sufficiently to 
10 align the KMSKS residues with their QtRS counterparts and thus 
involve them in phosphate binding. 

Fig. 4 shows a Connolly surface generated around the propo*sed>. 
PS active sites. The skilled person would immediately ^ -.v: 
15 recognise that residues shown in the figure would be involved>>'r 
in -.respective interactions with Mg^"*"/ .ATP, pantoate and p- 
alanine V. . •;sv=- 

The Connolly surface opens besides, the ATP ribose group and 
20 the walls are formed by. fully conserved residues, which are . 
largely hydrophobic but include some polar groups;.. The 
catalytically essential Mg^**" ion is shown at its most likely 
position where it is bound to OGseriesf 0HTyr7i> O^^tp^ and O^^tp* 
This is also the proposed Mg^"*" binding position in PPAT. 
25 Slightly more speculatively/ the most favourable conformer of 
pantoate is shown positioned in a cavity where it appears to 
satisfy the hydrophobic, and hydrogen-bonding interactions of 
the substrate, as well as being suitably positioned for attack 
on ATP. 
30 ' 

Binding positions for p-alanine may also be proposed^ but with 
less certainty than the binding positions of ATP and pantoate. 
For example, the p-alanine carboxylase may "bind in a 
conserved, positively charged pocket to Argl23, with MetSO, 
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Phe62 and Tyr71 providing a hydrophobic patch to accommodate 
the two (3-alanine methylene groups^ and His 12 6 being suitably 
positioned to deprotonate the NH3'*' group. 

5 A list of the residues which line the binding pockets is 

provided in Table 4 . Some or all of these residues may be used 
to model PS active sites in the various aspects of the 
invention discussed above. 

10 Structure-Based Drug Design 

Determination of the 3D structure of PS provides important 
information about the likely active sites of PS;^ particularly 
when comparisons are made with similar enzymes. This 

15 information may then be used for rational design of PS 

inhibitors, e.g. by computational techniques which identify 
possible binding ligands for the active sites, by enabling 
linked-f ragment approaches to drug design, and by enabling the 
identification and location of bound ligands using X^ray 

20 crystallographic analysis. These techniques are discussed in 
more detail below. 

Greer et al. mentioned above describes an iterative approach 
to ligand design based on repeated sequences of computer 

25 modelling, protein-ligand complex formation and X-ray 

analysis. Thus novel thymidylate synthase inhibitor series 
were designed de novo by Greer et al., and PS inhibitors may 
also be designed in the this way. More specifically, using 
e.g. GRID on the solved 3D structure of PS, a ligand (e.g. a 

30 potential inhibitor) for PS may be designed that complements 
the functionalities of the PS active site(s). . The ligand can 
then be synthesised, formed into a complex with PS, and the 
complex then analysed by X-ray crystallography to identify the 
actual position of the bound ligand. The structure and/ or 
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functional groups of the ligand can then be adjusted, if 
necessary, in view of the. results of the X-ray analysis, and 
the synthesis and analysis sequence repeated until an 
optimised ligand is obtained. Related approaches to 
5 structure-based drug design are also discussed in Bohacek et 
al.. Medicinal Research Reviews, Vol.16, (1996), 3-50. 

As a result of the determination of the PS 3D structure,- more 
purely computational techniques for rational drug design may 

10 also be used to design PS inhibitors (for an overview of these 
techniques see e.g. Walters et al. mentioned above).. For 
example, automated ligand^receptor docking programs (discussed 
$,g. by Jones et al. in Current Opinion in Biotechnology / 
Vol.6, (1995), 652-656) which require accurate information^ on- 

15 the atomic coordinates of target receptors may be used to - re - 
design potential PS inhibitors. - 

Linked-f ragment approaches to drug design also require 
accurate information on the atomic coordinates of target 

20 receptors. The basic idea behind these approaches is to 
determine (computationally or experimentally) the binding 
locations of plural ligands to a target molecule, and then 
construct a molecular scaffold to connect the ligands together 
in such a way that their rela^ binding positions are 

25 preserved. The ligands may be provided computationally and 

modelled in a computer system, or provided in an experimental 
setting, wherein crystals according to the invention are 
provided and a plurality of ligands soaked separately or in 
mixed pools into the crystal prior to X-ray analysis and 

30 determination of their location. 



The binding site of two or more ligands are determined and may 
be connected to form a potential lead compound that can be 
further refined using e.g. the iterative technique of Greer et 
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al. For a virtual linked- fragment approach see Verlinde et 
al.^ J. of Computer-Aided Molecular Design, 6r (1992), 131- 
147, and for NMR and X-ray approaches see Shuker et al,, 
SciencBr 274, (1996), 1531-1534 and Stout et al., Structure, 
5 6,. (1998), 839-848. The use of these- approaches to design PS 
inhibitors is made possible by the determination of the PS 
structure . 

Many of the techniques and approaches to structure-based drug 
10 design described above rely at some stage on X-ray analysis to 
. identify the binding position of a ligand in a ligand-protein 
complex. A common way of doing this is to perform X-ray 
crystallography on the complex, produce a difference Fourier 
electron density map, and associate a particular pattern of 
15 electron density with the ligand. However, in order to 
produce the map (as explained e.g. by Blundell et al. 
mentioned above) it is necessary to know beforehand the 
protein 3D structure (or at least the protein structure 
factors) . Therefore, determination of the PS structure also 
20 allows difference Fourier electron density maps of PS-ligand 

complexes to be produced, which can greatly assist the process 
of rational drug design.. 

The approaches to structure-based drug design described, above 
25 all require initial identification of possible compounds for 
interaction with target bio-molecule (in this case PS) . 
Sometimes these compounds are known e.g.. from the research 
literature. However, when they are not, or. when novel 
compounds are wanted, a first stage of the drug design program 
30 may involve computer-based in silico screening of compound 

databases (such as the Cambridge Structural Database) with the 
aim of identifying compounds which interact with the active 
site or sites of the target bio-molecule. Screening selection 
criteria may be based on pharmacokinetic properties such as 
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metabolic stability and toxicity. However, determination of 
the PS structure allows the architecture and chemical nature 
of each PS active site to be identified, which in turn allows 
the geometric and functional constraints of a descriptor for 
5 the potential modulator of PS activity to be derived. The 

descriptor is, therefore, a type of virtual 3-D pharmacophore, 
which can also be used as selection criteria or filter for 
database screening. 

10 Compounds which have a chemical structure selected using the 
methods of the invention, described herein, wherein said 
compounds are, PS modulators, form. a further aspect of the 
invention. Such compounds may be used in methods of medica^B^-i: 
treatments, such as in the treatment of bacterial inf ectibnSi^?:->C' 

15 in the human or animal, body. The compounds may be used alone-^ ■ 
or in conjunction with other anti-bacterial ..compounds to . • . 
enhance - their effect. . .'^"^^ 

While the invention has been described in conjunction with the 
20 exemplary embodiments described above, many equivalent 

modifications and variations will be apparent, to, those skilled; 
in the art when given this disclosure. Accordingly, the 
exemplary embodiments of the invention set forth are 
considered to be illustrative and not limiting. Va.rious 
25 changes to the described embodiments may be made without 
departing from the spirit and scope of the invention. 
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Table 2 



Composition of defined xninimal culture medium for selenium-containing PS. All components were filter- 
sterilized through 0.22|im filters, except where indicated. 



Compound 


Stock cone. 


Volume 


Comments 


- M9 medium' 


V 


250 ml 


Autoclaved. 




1 IVl 


250 \A 


Autoclaved separately firom M9 medium to avoid 
precipitation. 


D-glucose 


4%w/y 


25 ml 


Not autoclaved, since that caused glucose to 
caramelize (yellow colour); filter sterilized instead. 


Thiamine 


0.5% w/v 


25^1 


Prepared: stock and stored at -2p*'C; since repeated 
cycl es of freeze and thaw do not damage it 


FeSO^ 


4.2 gA 


250 jxl 


Prepared stock and stored at -r20°C, to prevent 
oxidation. 


Ampicillin 


lOOing/ml 


250- fil 


Filter sterilized and stored as aliquots — cycles of 
freeze and thaw were avoided. \ 




70mg/ml 


250 ^1 




L-arginine 


2.53% w/v 


-> mi 


CiinnlAmonfwf 'fi^r- ATI '^71 il^'firipnr'V nrpnarAfl 
ouppicilliSuLCU xUl xxX-X J / J, ucxLuicuyy y .^icpoicu 

togetheras single stock. 


L-histidine 


0:31% Wv ;-^^ 






L-proline 


4.6% w/v 




' - ■ ' -r- = ■. ■ - : - 


Adenine 


1.35% w/v .. 






L-iysine 


12.5 g/1 


2 ml 


Cocktail for methionine pathway inhibition; prepared 
as one stock. Final conc^trations were 100 arid 
50 mg/l respectively." 


L-phenylalanine 


12.5 g/1 






L-threonine 


12.5 gA 






L-isoleucine 


6.25 gA 






L-leucine 


6.25 g/1 






L-valine 


6.25 g/l 






L-seleno-methionine 


Final cone: 50 mgA 




No need to sterilise, to minimise risk of pxidation. 
Dissolved in water directly in botde in which 
supplied, then added. 



■ Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989). Molecular cloning: a laboratory manual, 2nd ed. . 
Cold Spring Harbour Laboratory Press, Cold Spring Harbor, N.Y. 
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Crystallographic refinement 


No. reflexions (test set) 
No. restraints 
No. parameters 

Weight for geom. restraints (T/VT) 


77 294 (4062) Test set is excluded from refinement for cross-validation 
15 730 Restraints in TNT with a weight assigned 
20236 

3 • 


Final model parameters 


Residues 
Hetero 

No. water molecules 
No. non-hydrogen atoms 
Resolution range (A) 


566 

1 Tris, 2 ethanedio! 

622 

5059 

45-1.7 


Refinement convergence 


^free 
/^factor 

DDQ (score, ranking) UFO 

DDQ-R 

Average jB-factor, subunit A (A^) 
subunil B (A^) 
waters (A^) 

Wilson distribution Bfacmr (A^) 


24.9 /?£nor calculated using test reflexions 

22.6 /^faaor = ^hHFobsI - iFcafcil / ^hlFobsi. w/o tcst rcflexions. 

0.7 1 (bottom 25 %) "Unassigned positive Feature left-Over score" 

1 5.2 (bottom 25%) Ratio of Shift and Water peak contributions. 

33.9 

36.4 

47.8 ' ' 

28.0 


Model quality 


Ramachandran plot 

Rms deviation from ideal 
. ("root mean square") 

Procheck criteria 
WhatCheck criteria 


% residues in most favoured region . 92.2 
% residues in generously-allowed region 7.4 
No. residues in disallowed region 0 
Covalent bond lengths (A) .0.018 
Bond angles n 1.41 
Planar groups (A) 0.007 
% bond lengths outside expected limits 2.6 
% bond angles outside expected limits 3.1 
' % planar groups outside expected limits 1 .0 
No. unsaturated H-bonds 2 
No. residues in unusual environments 14 
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Table '4 " 

Residues lining the PS binding pockets 



Val27 




Pro28 . 




Thr29 




Met30 




Gly31 




Asn32 




Leu33 - 




His34 




Asp35 




Gly36 




His37 




Lys39 




Leai40 




Ser54 




Phe56 




AsnSB . 




Gln6.1 . 




Phe62 


• " 


Tyr71 




•Phe91 •■ 




Pro93 




Xle98 




LeullB 




Glull9 




His 12 6 




Phea27 




Vall30 




ilel33 




Vall34 




Lysl36 




Leul37. . 




Leul40 




Cysl47 




Phel48 




Glyl49 




GlulSO 




LyslSl 




Aspl52 




Phel53 




.Ginl54 




Glnl55 : 




Leul56 




Ilel59 




Ilel72 




Vall75 




Prol76 




Ilel77 




Metl78 




Argl79 




Leul84 




Alal85 




Leul8 6 




Serl87 




SerlB8 




Argl89 




Asnl90 




Asp242 




Leu251 




Leu264 




Val265 




Ala266 




Arg273 




Leu274 




Ile275 




Asp27 6 




Asn277 
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Claims ; 

1. A crystal of pantothenate synthetase (PS) having a 
monoclinic space group P2i and unit cell dimensions of a = • 
66.0±0.2A, b = 78.1±0.2A, c = 77.110.2A and p = 103.7±0.2°. 

2. A crystal of PS having the three dimensional atomic 
coordinates of Table 1. 

3. A method for crystallizing a selenium atom PS derivative 
which comprises producing PS by recombinant production in a 
bacterial host in the presence of selenomethionine, recovering 
a selenium atom PS derivative from the host and growing 
crystals from the recovered selenium atom PS derivative. 

4. A computer-based method of rational drug design which 
comprises: 

providing the structure of the PS as defined by the 
coordinates of Table 1; 

providing the structure of a candidate modulator 
molecule; and 

fitting the structure of the candidate modulator molecule 
to the structure of the PS of Table_l.... 

5. A computer-based method of rational drug design which 
comprises: 

providing the coordinates of at . least two atoms of the PS of 
Table 1; . 

providing the structure of a candidate modulator molecule; 



and 



fitting the structure of the candidate modulator molecule 
to the provided coordinates of the PS. 

6. A computer-based method of rational drug design which 
comprises: 
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providing the coordinates' of' at "le'E'st "'a sub-doinain "of " tKe 
PS; . 

providing the structure of a candidate modulator molecule; 

and 

. fitting the structure of the candidate modulator molecule^ 
to the coordinates of the PS sub-domain provided. 

7 . The method of any one of claims 4 to 6 which further 
comprises the steps of: 

obtaining or synthesising the candidate modulator; and 
contacting the candidate modulator with PS to determine 
the ability of the candidate modulator to interact with PS. 

8 . The method of any one of claims 4 to 6 which further 
comprises the steps of : 

obtaining or synthesising said candidate modulator; 

fprming a complex of PS and said potential modulator; .and. 

analysing said complex by X-ray crystallography to 
determine . the ability of said candidate modulator to interact 
with PS. . 

9. A compound having a chemical structure selected using the 
method of any one of claims 4 to 6^ said compound being an 
inhibitor of PS. 

10. A computer readable medium with either (a) atomic data 
according to Table 1 recorded thereon, said data defining the 
three-dimensional structure of PS or at least one sub-domain 
thereof or (b) structure factor data for PS recorded thereon, 
the structure factor data being derivable from the atomic 
coordinate data of Table 1. 

11. A computer system, intended to generate structures and/or 
perform rational drug design for PS or complexes of PS with a 
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potentia-1 modulator, the -system- containing either "(a) atomic' " 
coordinate data according to Table 1, said data defining. the 
three-dimensional structure of PS or at least one sub-domain 
thereof or (b) structure factor data for PS, said structure 
factor data being derivable from the atomic coordinate data of 
Table 1. 
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Fig. 4 
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